Urban street networks: a comparative analysis of ten European cities 
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We compare the structural properties of the street networks of ten different European 
cities using their primal representation. We investigate the properties of the geometry 
of the networks and a set of centrality measures highlighting differences and similarities 
among cases. In particular, we found that cities share structural similarities due to their 
quasi planarity but that there are also several distinctive geometrical proprieties. A 
Principal Component Analysis is also performed on the distributions of centralities and 



their respective moments, which is used to find distinctive characteristics by which we can 
classify cities into families. We believe that, beyond the improvement of the empirical 
knowledge on streets network proprieties, our findings can open new perspectives in the 
scientific relation between city planning and complex networks, stimulating the debate 
on the effectiveness of the set of knowledge that statistical physics can contribute for city 
planning and urban morphology studies. 
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I. INTRODUCTION 

Defining urban form is certainly an important and difficult issue, especially if one wants to 
supply useful knowledge to urban planners and urban designers or new knowledge for city scien- 
tist. In this paper we address this question, and we try to improve the empirical-based knowledge 
on the structure of a city by studying the urban street networks of ten European cities, namely 
Edinburgh, Leicester, Sheffield, Oxford, Worcester, Lancaster, Catania, Barcelona, Bologna and 
Geneva. The form of cities is the subject of an area of urban studies named urban morphology. 
Urban morphology in its current form emerged between the 40s and the 60s of the XX Century 
from the work of two scholars as prominent as different: the German-born and then British urban 
geographer M.R.Conzen (1960), and the Italian architect and historian Saverio Muratori (1960). 
In this area, the main subject of investigation is the urban fabric of the city at the scale of the 
neighborhood, street, plot and building. 

A different branch of urban morphology has stemmed from the sciences of complex systems 
building on a long standing tradition in regional analysis, economic geography and modeling 
HAnas 1998H . Complexity in the built environment is here investigated with the same instruments 
used for other classes of self-organized phenomena in nature, technology and society JBatty 200? |. 



These works are now flanked by a growing interest on complex spatial networks within the com- 
munity of physics HBoccaletti et al. 2 0061. 

Spatial networks and in particular planar graphs are suited to model a number of real phenom- 



ena [Barthelemy 201 1 1. We are here interested in the study of a particular class of spatial networks 
that describe the street pattern of cities. The beginning of these studies can be traced back to the 
classical works on regional transportation networks based on graph theory [Garriso n et al. 19621 
Kanskye^a/. 1989 1. The advent of complex system science and its paradigm HBarabasi 1 999, 
Barabas i 200211 . jointly with the increasing availability of spatial and time geo-referenced data, 
has given a new boost to these studies, and several important contributions have appeared recently 



IBarthele mylOTT] IStrano et al. 20121 . 



In UMasucci et al. 20 091, Masucci et al. study the structural property of the London street net- 



work in its dual and primal representation. In [Jiang 2007 1, the autors, by using 40 urban networks 



in a dual representation, found a small-world structure and a scale-free property for both street 
length and connectivity degree, and used various centrality indices as indicators of the importance 
of streets. Lammer et al. developed a comparative analysis of the betweenness distribution in 20 
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cities in Germany, suggesting a relation with vehicular traffic ILam mer et al. 20061 . Others have 
focused on centralities in primal and dual representations of street networks UPorta et al. 20061 
IPorta et al. 20061 IPorta et al. 20061 ICrucitti et a/."^006l IHillier et al. T9841 IHillier 1999H . and on 
other structural features, such as the number of cycles of a given length [Cardillo et al. 20 061. 

However, an important and still open question in urban morphology has to do with the char- 
acterization of classes of cities based on their form. This is a preliminary step to approach a 
comparative analysis aimed at the classification of cities. In this paper we propose a classification 
based on the distribution of street centrality by cross-comparing real cases and therefore with no 
use of null models. We limit our study to the characterization of city form without exploring its im- 
pact on collective behaviors, an area of research which, at the scale of entire cities, is now finding 
new opportunities through the exploitation of massive datasets from online geo- social networks 
and mobile geo-referenced systems URatti et al. 20101 [Expert et al. 201 lp 

In our study, we first observe the geometry of the networks following the approach recently 
suggested in HChan et al. 201111 : we consider the distribution of basic indices of the primal street 
networks such as street angles at intersections and street lengths. Secondly, we investigate four 
different centralities indices computed over the entire urban street network, highlighting their dis- 
tribution and their mutual correlation. 

We considered 10 European cities namely Edinburgh, Leicester, Sheffield, Oxford, Worcester, 
Lancaster, Catania, Barcelona, Bologna and Geneva. We show that these cities share some struc- 
tural geometric properties, which are the same found in other planar spatial systems such as those 
of leaves UPerna et al. 20111 ICoudergf a/. 20021 IBohn et al. 20021 , which suggests that planarity 
in itself is a driver across spatial systems in various domains. 

However, we also show that cities are different. We witness that some cities, like Catania, 
stand alone in terms of basic geometric properties and, more importantly, that the various dis- 
tributions of centrality are reconcilable to one common pattern (a power law) only if a largely 
minoritarian subset of streets are taken into consideration. These results have to do with the ex- 
treme heterogeneity of the cities' visible traits as resulting from the interplay of entirely different 
phenomenon in time, such as historical accidents (including planning), physical constraints or just 
random events [Batty 2005 1. In particular, we show how cases tend to cluster in groups after the 
whole set of centrality measures distributions are considered in a way that suggests major planning 
interventions and physical geographic constraints are key. 

The paper is organized as follows. In Section|n]the case studies are presented and their geomet- 
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ric properties are introduced and analyzed. In Section III the study of the four centrality indices 
is illustrated along with the clusterization of cases according to their combined behavior as a re- 
sult of the application of a Principal Component Analysis (PCA) of the distribution of those four 
centralities. Finally in Section [TVlwe offer a discussion of the results and our conclusions. 



II. BASIC PROPRIETIES OF URBAN NETWORKS 



We address the analysis of 10 European cities, namely: Edinburgh, Leicester, Sheffield, Oxford, 
Worcester and Lancaster (UK); Catania and Bologna (IT); Barcelona (ES); Geneva (CH) (Fig. [T]). 
These cities are variously located and present remarkably different economic, cultural and climatic 
conditions along with great variety of characteristics like population and size (Tab. [I]). We represent 
the street network of these cities such that intersections are nodes and streets are links between 
nodes, i.e. a primal representation UPorta et al. 20 061. We analyse such networks in terms of their 
basic properties and several geometric indices such as street length and the angle they form at 
intersections. 




FIG. 1 . Geographical location of the cities. 



Before getting deeper into the analysis, we introduce some basic concepts of graph theory. A 
graph (or network) is a mathematical object which consists of two sets: N and C. The N elements 
of the former are called nodes, while the E elements of the latter (unordered couple of nodes) are 
called links. 

There are many ways to represent a graph, but, the most common one is the adjacency matrix A, 
a N x N square matrix whose entry = l,...,N) is equal to one if link between nodes % 



6 



and j exists and zero otherwise. The degree of a node i, hi, is the number of links incident with it. 
The average degree (k) = 2E/N, is nothing but the average of the degrees over all the nodes in 
the network. 

Networks of street patterns belong to a particular class of graphs called, planar graph, i.e. 
graphs whose links cross only on nodes. In our case, the nodes represent street intersections, 
while the links are the streets centerline, a network made using this convention is called primal 
network (Fig. [2]). Our networks are also weighted and each link carries a numerical value u>y 
expressing the intensity of the connection. The natural choice, in our case, for the functional form 
expressing the weight of a link connecting nodes, say i and j, is to put Wij equal to the length of 
the street connecting, 1^. 




FIG. 2. Example of street networks, Leicester (Left) and Worcester (Rigth). The streets with length that 
are not following a power-law distribution are shown in black, while streets with length that follows a 
power-law distribution are shown in grey. 

Our analysis starts by importing the street system into a Geographical Information System 
(GIS) environment. Data of the street systems have been retrieved from different sources: for 
example, in all UK cases we have used the Ordnance Survey maps, while in Italian cases we have 
used data from City Councils planning offices, and in Barcelona we worked on a dataset provided 
by the Agenda d'Ecologia Urbana de Barcelona. Given that these geographical street networks 



7 



had been mainly built for the sake of traffic navigation or planning, they presented characteristics 
that not always fitted the purposes of a centrality analysis; for example, multi-lane streets were 
usually represented with one link per lane rather than one link per street. As a result we have 
prepared our databases by first cleaning the networks accurately to remove link redundancies, 
fix short missing links, collapse unconnected links on the same node when needed, continuously 
confronting the networks with aerial images of the real cities drawn from remote sensing sources 
such as Google Earth. Such procedure was undertaken both manually and through ad-hoc tools 
in a GIS environment. For the definition of the boundary of the urban systems, we followed the 
border of the built-up area extending it by roughly 1km outbound. 

Considering the entries of Tab. |I} we can see how various selected cities are for example in 
terms of size, from small cases like Lancaster to large ones like Barcelona, or in terms of street 
intersection density, from very dense cities like Catania to more sprawled ones like Edinburgh. 
We have selected cities with different levels of geographic constraints, from those like Geneva 
and Oxford traversed or limited by large natural water features to those like Catania and Bologna 
that sit on uninterrupted plains, and a different prominence of planning history, from those self- 
organized or only fragmentarily planned like Leicester or Bologna to one like Barcelona whose 
street layout had been heavily determined by one single planning vision (the 1859 Cerda Plan). 
We see that the particular planning history of Barcelona is reflected in the low values of both the 
standard deviation of the street length and the percentage of dead-end streets on the total number 
of streets, both resulting from the extensive adoption of a rigid homogeneous grid layout. The 
extreme diversity of selected cases has been pursued in order to make the comparative analysis of 
similarities and differences more profound. 

The study of the geometric properties of the networks has been focused on the distribu- 
tion of three indices with the aim to find common patterns: street length, angles formed be- 
tween street intersections, and the relation between dead-end link length and the area of the 
cycle they belong to. Following the definition given in HChan et al. 201111 . we consider cy- 
cles as polygons formed by closed loops of links. We build our approach on previous find- 
ings that have identified universal geometric patterns under seemingly diverse street networks 
in cities IIMasucci et al. 20091 |Barthelemy et al. 2008| ILammer et al. 20061 IPerna et al. 20111 
ICouder et al. 20021 IBohn et al. 200211 and extend our exploration to focus on local patterns that 
actually make for the uniqueness of each case or clusters of cases. 

In a city both long and short streets play an important role. The former allow connecting distant 
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City 


Population 


Area (km 2 ) 


N 


E 


(k) 


pCkm- 1 ) 


L (km) 


{£) (m) 


(Ji (m) 


/(%) 


Barcelona 


1,615,908 


82.0 


6452 


11071 


3.43 


15.15 


1242 


110.7 


105.1 


0.1 


Bologna 


380,878 


88.6 


5200 


7359 


2.84 


9.19 


814 


119.1 


158.0 


23 


Catania 


293,811 


34.0 


11099 


14039 


2.52 


23.91 


813 


55.9 


57.3 


17 


Edinburgh 


477,660 


195.6 


5021 


13063 


2.43 


8.96 


1752 


110.0 


147.0 


24 


Geneva 


191,237 


95.0 


6183 


8681 


2.80 


11.18 


1062 


122.4 


119.7 


0.9 


Lancaster 


45,952 


77.7 


5913 


15567 


2.51 


9.28 


721 


96.8 


153.6 


18 


Leicester 


288,000 


122.2 


7186 


8896 


2.47 


7.23 


883 


98.5 


94.5 


18 


Oxford 


149,800 


51.1 


4372 


11071 


2.32 


10.27 


525 


103.0 


133.7 


28 


Sheffield 


520,700 


187.5 


14583 


17674 


2.42 


10.58 


1983 


111.0 


129.8 


21 


Worcester 


94,300 


45.1 


4685 


5538 


2.36 


11.75 


530 


94.8 


126.6 


21 



TABLE I. Basic proprieties of the primal networks. N and E are the number of nodes and number of edges, 
respectively, while (k) is the average degree. The density p is given by the ratio between the total length 
(L) and the Area. (£) indicates the the average length of the edges, while and ae corresponds to its standard 
deviation. / indicate the percentage of tree-like appendixes. 



locations, the latter act as shortcuts between longer streets reducing the average path length in the 
navigation of the system. Short and long streets have a different historical meaning in the evolution 
of cities, as street length tends to diminish with increasing density of the urban pattern, following 
a rule according to which the more "urban" the area the shorter the street. 

The overall average street length (£) of selected networks represents a simple and good indica- 
tor of the diversity of cities, and looking at Tab. [I] we find a considerable variation of such average 
that, even putting aside the special case of Catania with (£) = 56m, which is due to the extreme 
density of the historical core, lays between 94.8m (Worcester) and 122.4m (Geneva). 

We have then observed the distribution of street length in the selected cases. Since we are 
comparing cities with different size, we have considered the normalized length £, i.e. the street 
length divided by the diameter of the network defined as the maximum Euclidean distance between 
any couple of nodes belonging to the network. 

In Fig. [3] we confirm the findings of [Barthelemy et al. 2008 1, that the relation between the 
total length L and the number of nodes N scale as yN. However, we notice that our cases are not 
closely distributed along a straight line, indicating a significance variance that can be explained by 
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the different nature of our dataset: we are in fact comparing a smaller number of cases; moreover, 
our cases are large networks representing entire cities, which means that we are here handling non 
homogeneous and invariant street networks made of parts derived by different historical formations 
and shape. It is exactly this variability that we want to capture with a closer look at the differences 
emerging from the data. 
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FIG. 3. Total street length L as a function of v iV 



The distribution of streets' length is the simplest and first indicator that we use for changing 
such differences. Previous findings proposed a power law distribution with a cut-off for the longest 
street segments [Ma succi et al. 20091 and a bimodal shape distribution with a plato region above 
30m HChan et al. 201 11 . We are finding here slightly different results. If we look at Fig. [4^, we 
see that a power law emerges in the distribution tail though, of course, the accordance between the 
distribution and the fitting became worse with decreasing of the streets' length. However, since we 
want to look at local patterns as well as global, we want to investigate what happens in the region 
that is not well fitted by a power law. 

In Fig. |4j), we plot also the same distribution in a semi-log scale. Here it is possible to see 
that most of the distributions exhibit a peak in the region between 8 x 10~ 4 and 3 x 10~ 3 , which 
means that the majority of streets have a normalized length around that values. However Catania, 
Barcelona and Worcester have different behaviors whose causes might be traced back to historical 
accidents. Catania falls out of these boundaries because of its abundance of very short streets 
in the historical urban center, possibly a consequence of the complete reconstruction of the city 
center the disastrous eruption of the Etna Volcano in 1669. Barcelona presents a anomalous peak 
clearly related to the Plan Cerda, mentioned above, a massive grid-iron plan covering the central 
part of the current urban area. Worcester exhibits a double peak in the length distribution, a 
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consequence of the post XXII planning process that clearly shapes the most of the periphery. From 
this simple recollection we can appreciate the impact of specific historical occurrences that mostly 
have impacted on the urban form in terms of an interplay between planned and the non-planned 
urban forms. 




length / diameter length / diameter 

FIG. 4. Distribution of normalized length I for the ten cities in semi-log scale (right) and for two repre- 
sentative cities, Catania and Edinburgh, in the log-log scale (left). It is possible to observe how different 
visualization give different results. While in the semi-log plot the distribution of the majority of streets seem 
different, in the log-log they have the same trend. 

This variety of street patterns gets entirely hidden in the conventional representation of data 
through log-log charts and if we look at what part of the street layout falls within the region that is 
correctly fitted by a power law function Fig. [4^, we see that it actually represents a vast minority 
of the entire network in all our cases. 

In terms of sheer number of streets, using the method proposed in HClauset et al. 20091 we 
observed that the percentage of streets falling inside the power law region ranges from 4% in 
Barcelona to the 29% in Lancaster as shown in Table |Il| Of course, streets falling in the power 
law region are the longest, so they cover a larger share of the system in terms of street length; 
however, even so, the percentage of the total street length belonging to streets falling outside the 
power law region in most of cities it is up to the 60% as shown in the last column of Table |n} 
We can appreciate visually the geographic consistency and character of this left over from power 
law portion of the street network in Leicester and Worcester in the Fig. [2j clearly, this portion 
represents not only the majority of the street network, but also the part that is historically more 
important, the denser and the more central, which is no surprise if we think that it is made by the 
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shortest streets however generalization and conclusions may be improved by further and deeper 
tests on larger areas. For the time beeing, based on this resutls, we may argue that cities are 
composed by streets following two distribution and that it may reflect different dynamics of urban 
evolution. 



City 


Threshold (m) 


% of streets in a Power Law distribution 


% of L not in the Power Law 


Barcelona 


347 


4 


86 


Bologna 


122 


28 


37 


Catania 


143 


6 


77 


Edinburgh 


194 


13 


59 


Geneve 


218 


13 


63 


Lancaster 


93 


29 


32 


Leicester 


211 


9 


74 


Milano 


233 


8 


71 


Oxford 


184 


13 


58 


Sheffield 


245 


8 


70 


Worcester 


166 


14 


55 



TABLE II. The total amount of streets are following a Power Law distribution is very low and includes only 
the longer streets of the city. In terms of total street length the percentage of street are not following the 
Power Law distribution are always the majority with the exception of Lancaster. 

Streets are not always straight lines. In order to study the distribution of angles formed by 
streets at intersections we must use an equivalent network in which all the streets are represented 
by straight lines (i.e. substituting the curved streets with straight ones) and where the link weight 
is equal to the Euclidean distance between its end nodes. We name this network Euclidean net- 
work. As for all the generalization models, results should be interpreted with caution because of 
the effect that the approximation may produce on the real structure of the network. Such caution 
suggests a preliminary test. The inset in Fig. [5] shows small divergences between the distribution 
of street length in the original and the Euclidean networks for the city of Leicester. Such sim- 
ple test confirms that the Euclidean generalization does not lose relevant information, i.e. that 
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streets in cities are not always straight, but predominantly so, confirming the finding provided in 
HChan et al. 20111 . Therefore we are confident to study the distribution of the angles formed by 
street intersections in the Euclidean network as a reliable approximation of the original. The dis- 
tributions of the angles formed by streets at intersections are shown in Fig. [5j At a glance, we 
note that all the cities share the same behavior exhibiting a double peak shaped distribution around 
the characteristic values of 90 and 180 degrees, respectively. This finding confirms the analogous 
results that have been found in other kinds of spatial planar networks, like those of leaf venations 
HCou der et a l. 2002UBohn et al. 200211 . as a result of tensorial stress fields or simple force models. 
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FIG. 5. Distributions of the angles formed by street intersections of the cities' Euclidean networks. All the 
cities show the same shape. The inset shows the street length distribution of the original network and the 
Euclidean one for the city of Leicester. 



The incidence of certain motifs in complex networks is a long standing object of investiga- 
tion, IM ilo et al. 2002II . Here we want to focus not on the cycle's shape and quantity, but on the 
relation between the total length of dead links within a given cycle and its area. Even if we are 
measuring static systems, i.e. systems that do not change in time, we should remember that a 
cycle is the result of an evolutionary process that starts with short dead end streets sprouting from 
the longest edges of the cycles and then extending towards the opposite edge until splitting the 
original cycle in two smaller sub-cycles. Dead end streets can be interpreted as sprouts of new 
cycles in parcels still subjected to evolution or as crystallized fractures that do not undergo further 
development. Their quantity is given by the index / shown in Tab. [j] and it can be thought of as 
an estimator of the abundance of cycles in the intermediate evolutionary stage of their lifetime 



as suggested in [Barthelemy et al. 2008 1. Such assumptions are supported by the result shown in 
Fig. [6j where we report the sum of the length of dead links inside each cycle versus the area of 
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the cycle itself for each city. The distribution shows a clear power-law behavior with a common 
exponent close to 0.8. It is worth to note that the power-law behavior is not affected by any fac- 
tor like the fraction of dead ends /, or the average degree (k) . Similar results have appeared in 
ULammer et al. 20061 IPerna et al. 20 111 . Of course, our findings can be truly confirmed only by 
investigating the evolution of urban streets in time with the support of empirical data. 




10 3 10 4 10 5 10 6 

area (m 2 ) 



FIG. 6. Total length of trees like (dead ends) links as a function of the cycle area for the considered cities. 
The distributions show a power-law behavior with an exponent around 0.8. 

HI. CENTRALITIES AND CITY CLASSIFICATION 

The concept of Centrality has been used for many years in network and social science and, 
starting from the seminal work by Wasserman [Wasser man et al. 19941 , there has been a growth 
of literature both regarding centrality on social networks as well as other kinds of networks. De- 
pending on the definition, centrality can be understood as meaning proximity between nodes, 
accessibility from other nodes, or being in a strategic position for connecting couple of nodes. It is 
clear that from different definitions of centrality, a node actor can be placed at different centrality 
ranks and that the same node can result with high value for a centrality while yielding weak values 
for another one. Therefore, for different cities, we can reasonably expect slightly different dis- 
tributions of centralities. Moreover, we can identify how centralities are mutually dependent and 
correlated. What makes centralities particularly suited for geographical studies is that they can be 
visualized and mapped. We are interested in understanding if and how these ranges of correlations 
and fluctuation can help to classify cities that share the main network morphology. For example, a 
grid-like network can be different from a radial one, but only looking at the statistical distribution 
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of centralities might not be enough for a proper classification. This problem has been already 
analysed by Crucitti and Porta UPorta et al. 2006H where the statistical distribution of Closeness, 
Betweenness, Straightness and Information centralities have been analysed for a sample of twenty 
city parcels of one square mile. Crucitti et al. found significant differences between cities and 
through a cluster analysis they proposed a classification of different urban patterns. The approach 
proved to be effective in capturing essential features of urban form as emerging in limited samples 
selected for their inner morphological consistency, but dealing with entire cities poses the problem 
of the classification of internally complex objects predominantly composed of different parts each 
possibly exhibiting different properties. So, the question about the validity of such a procedure on 
a whole city still need a response. In order to validate if centrality indices can be used in the clas- 
sification of entire cities, we propose here a clustering method based on the Principal Component 
Analysis (PCA) made on the distribution of centralities and on their moments. Before discussing 
this part of the research, we must introduce briefly the adopted centrality indices. 

Betweenness centrality, C B , is based on the idea that a node is more central when it is tra- 
versed by a larger number of the shortest paths connecting all couple of nodes in the network. More 
precisely, the betweenness of a node i is defined as in HWasserman et al. 19941 Freeman 19771 



where rijk is the number of shortest paths connecting j and k, while rijk{i) is the number of shortest 
paths connecting j and k and passing through i. 

Straightness centrality, C s , originates from the idea that the efficiency in the communica- 
tion between two nodes i and j is equal to the inverse of the shortest path length, or geodesic, 
dij ULatoragf a/. 200 111 . In the case of a spatial network embedded into a Euclidean space, the 
straightness centrality of node i is defined as: 



where d is the Euclidean distance between nodes i and j along a straight line. This measurement 
captures to which extent the connecting route between nodes i and j, let's say between each street 
junctions, deviates from a virtual straight route. 

The Closeness centrality , C c ,of a node i is based on the concept of minimum distance, in 
topological sense, i.e. the minimum number of edges traversed to get from i to j 



Freeman 19791 : 




(1) 




(2) 
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IIBoccaletti et al. 200611 and is denned as in llScott 20001 ISabidussi 1966ll : 

C? = - = — , (3) 

where L$ is the average distance from i to all the other nodes. Closeness centrality is a classical 
centrality index that has been widely used in urban geography and econometrics as well as in 
regional planning, where it gives an idea of the cost that spatial distance loads on many different 
kinds of relationships that take place between places, people, activities and markets. 

The Accessibility C A is a measure recently introduced by Travencolo and Costa 
UTravencolo et al. 2008H . It has been used for studying the property of very different spatial net- 
works. In the case of urban networks, it has been used to investigate the relationship between 
subway and road systems Hda F. Costa et al. 201 111 . In addition, the accessibility has been found to 
be closely related to the borders of networks [Tr avencolo et al. 200 91. in the sense that nodes with 
low accessibility tend to belong to these borders. The C A of node i measures the ratio of neigh- 
bouring nodes that are effectively reached by an agent randomly navigating the network against the 
actual number of nodes that belong to the neighborhood. More precisely, C A takes into account 
the number of nodes effectively accessed by each node of the network, as well as the probabilities 
of such accesses. First, we evaluate the transition probability Pij(h) which describes the proba- 
bility for an agent leaving from node i to reach node j after h steps along a given type of walk. At 
each step, the agent located at node q, chooses a random neighbor of q and jump to it. These rules 
define a random walk over the network. When the transition probabilities are very heterogeneous, 
we have low values of accessibility, meaning that the random walks are biased toward a certain 
number of nodes, which is, less than the number of nodes which can be reached after h steps. On 
the other hand, when the transition probabilities are homogeneous, all nodes which can be reached 
after h steps are accessed, on average, the same number of times. This case corresponds to the 
highest values of accessibility. The heterogeneity of the transition probabilities is quantified in 
terms of the classical concept of entropy, so that the mathematical definition of accessibility, of a 
node i, with respect to the number of steps h is given as 



C A (h) = exp 



N 

"£PiAh) log 



(4) 



Also, we have considered the transition probability for unitary step (h — 1), as: 
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where Wij is the weight of the edge In order to take the geography into account, we consid- 
ered Wij = l/df" cl . For disconnected nodes, we assume df" cl = oo such that w^j = 0. 

In order to investigate if the distribution of centralities can describe the main geographical dif- 
ferences within cities, we use the PCA approach. This well-established method of multivariate 
statistics implements a projection of the distribution of objects (in our case, cities) from a higher 
into a lower dimensional space such that the maximum dispersion of the data is observed at the 
first new axis (or principal variable), and so forth. This projection is optimal in the sense of fully 
decorrelating the original data, therefore removing all correlations between the original measure- 
ments describing the objects. So, since the data dispersion is better described by the first principal 
axes, the remainder axes can be discarded. PCA is therefore particularly relevant to the presents 
studies because: (i) it decorrelates the original measurements; (ii) it provides a projection of the 
data that maximizes their dispersion (i.e. the differences between the cities); and (iii) it allows the 
visualization of the distribution of objects (when projected into 2D or 3D spaces). 

The PCA consists in obtaining the covariance matrix of the original data and then extracting 
its eigenvalues and respective eigenvectors. The eigenvalues can be shown to correspond to the 
variances along each new axis, and the each respective eigenvector component provides the co- 
efficient of the linear combination of the original measurements used to project the original data 
into the respective axis. Therefore, the effectiveness of PCA in projecting the data can be inferred 
by inspecting the eigenvalues in descending order. For instance, if the two largest eigenvalues 
account for 75% of the overall variance, it can be understood that these two axes are describing 
the original distribution of points in an effective way, and that the other axes can be overlooked 
without losing much information. 

First, we evaluate the histogram of each centrality measurement considering 20 bins. In this 
way, each dimension corresponds to the relative frequency of centrality values in a small range. 
The histogram of each one of the four centralities were merger together in order to create an 
eighty dimension feature vector for each city. For instance, Figure [7] (top) shows the feature vector 
obtained for the city of Leicester. 

In the second approach, the feature vectors were created by considering the 20th first moments 
of the centralities distributions merged together, also generating an eighty dimension vector. It 
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bins 

FIG. 7. The complete vector centralities and the contribution of each bin to the explanation illustrated in 
Fig. E 

is important to note that, both first and second approaches provided similar results. Finally, the 
dimension of the feature vectors were reduced from eighty to two, in which each original dimen- 
sion has contribution according the values shown in Figure [7] (bottom) and Fig. [7] (top) shows the 
explanation of each dimension. By plotting the first two dimensions, it is possible to account for 
almost 50% of the dispersion, while a clear differentiation between cities can be seen in Fig. [8} 
Since it is impossible to visualize the high-dimensional data so as to try to recognize the clusters, 
we used an agglomerative clustering method, the so-called complete linkage method to perform 
this task. In this method, the Euclidean distance between two clusters is given by the value of the 
shortest distance between any object belonging to these clusters. The final result is shown in Fig. [9] 
as a dendrogram, where the colours correspond to the two clusters identified by using a threshold 
parameter 0.7 of the maximum distance between two cities. 
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FIG. 8. PC A using only Straightness and Accessibility 
This figure clearly shows the emergence of one cluster of cities (grey color) and of a group 
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FIG. 9. The dendrogram is based on the agglomerative complete linkage method on the 12 descriptive 
components illustrated in Fig. 8. The dendrogram suggest the existence of two main families of cities that 
are basically with and without big geographical constraints. The only exception is given by Barcelona, that 
even without a natural constraint was subjected to a massive and strong planning operation. 

of cities very different each others but still belonging to a single family (black colour), so we 
may intepret this agglomeration as the existence of two main different classes or typologies of 
cities. In this classification a clear group composed by Catania, Bologna, Sheffield, Edinburgh 
and Leicester is separated from Geneve, Lancaster, Oxford, Worcester and Barcelona. Though the 
reasons behind this separation may be manifold and it needs further experimental tests, we notice 
that comparing the plans of those cities and the topography of their terrain it is possible to conclude 
that physical constraints can be a key factor to classify cities. We have found that cities traversed 
by rivers or bordered by a lake were separated from those that the growth has never been boundend 
or divided by physical constraints. The exeption of Barcelona, a city with a major planning events 
(the Plan Cerda) confirms this hypothesis posing the great urban planning operations in the list of 
the geographical constraints. 



IV. DISCUSSION 



In this research we analyse the street network of 10 European cities represented in a primal 
way, where intersections are translated into the nodes of a network and the connecting streets into 
the links. We study first the geometric properties of the networks and then the way centrality is 
distributed over them according to four different definition of centrality. We show that selected 
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cities share several universal geometric patterns, such as the average street length (which is lim- 
ited within a remarkably narrow range of values), the distribution of angles between streets at 
intersections, and the distribution of the total length of dead end streets as a function of the area 
of the cycle they belong to. In addition, we confirm that the distribution of street lengths follows 
universal power law behavior in the long tail of the dataset, i.e. for high values of street length. 
However, we highlight that the conventional way to represent the distribution of street lengths in 
search of the power law behavior leaves out of the picture the vast majority of the data, i.e. the 
number of streets whose length distribution cannot be accurately fitted by a power law function 
is in the order of the 90% of the whole dataset. We therefore investigated the actual behaviour 
of the street length distribution in a semi-log scale finding the emergence of remarkably different 
behaviours that seemingly reflect the diversity of local history and conditions. Finally, we found 
that the distribution of the four centrality indices over the street networks allows us to clearly 
characterize two different clusters of cities that appear to be predominantly informed by major 
topographic and geographic local features like the presence of rivers or lakes. We also found that 
planning events of extreme magnitude like the occurrence of the Plan Cerda in Barcelona may 
account for the uniqueness of this city as measured through the distribution of street centrality. 

We show how the analysis of simple geometric properties of the street network as well as a 
more complex evaluation of the street centrality can highlight differences between cities and how 
these differences can be used for classifying cities in categories. It is clear that global geometric 
characteristics of the street network exhibit "universal" behaviors but it is reasonable to argue 
that, as these behaviors equally emerge in a vast range of transportation networks in nature and 
technology that like the street networks are mainly planar, the universality of those features may 
be related to the planarity of the systems rather than a particular "nature" of cities as such. 

Regarding the analysis of the morphology of cities, we believe our study highlights that if 
the universal rules governing the evolution of planar hierarchical networks are not discriminated, 
the extreme differences of their inner urban structure may be easily underestimated and, from 
a qualitative point of view, it may leave local patterns out of the picture. We focused on this 
discrimination process showing how to operate at global level in order to highlight local patterns 
where the extreme diversity of our cities emerges and should be accounted for, especially when 
we move on to the problem of classification, i.e. that of finding a taxonomy for urban types. 



20 



A. Acknowledgments 

The authors thanks Shibu Raman, Tim Jones, Mike Jenks and Colin Pooley for providing the 
data of some of the analyzed cities. Emanuele Strano is grateful to Marc Barthelemy, Salva- 
tore Scellato and Andrea Perna for their suggestions and help and to the Faculty of Engineer at 
University of Strathclyde for his MRes grant. Luciano is grateful to FAPESP (05/00587-5) and 
CNPq (301303/06-1 and 573583/2008-0) for financial support. Matheus Viana thanks FAPESP 
(2010/16310-0) for his PhD grant. 

B. References 



[Anas 1998] Anas A, Arnott R, Small K A, 1998, "Urban Spatial Structure", Journal of Economic Litera- 
ture, 36 1426-1464 

[Banavar et al. 1999] Banavar J R, Maritan A, Rinaldo A, 1999, "Size and Form in Efficient Transportation 

Networks", Nature 399 130 
[Barabasi 2002] Barabasi A -L, 2002, "Statistical Mechanics of Complex Networks", Review of Modern 

Physics 74 47 

[Barabasi 1999] Barabasi A -L, 1999, "Emergence of Scaling in Random Networks", Science 286 509 

[Barthelemy et al. 2008] Barthelemy M, Flammini A, 2008, "Modeling Urban Street Patterns", Physical 
Review Letters, 100(13) 138702 

[Barthelemy 201 1] Barthelemy M, 201 1 , "Spatial Networks", Physics Reports 499 1-101 

[Batty 2005] Batty M, 2005, "Cities and Complexity. Understanding Cities with Cellular Automata, Agent- 
Based Models, and Fractals", The MIT Press Cambridge, Massachusetts London., 

[Boccaletti et al. 2006] Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang D -U, 2006, "Complex net- 
works: Structure and dynamics", Physics Reports 424 175 

[Bohn et al. 2002] Bohn S, Andreotti B, Douady S, Munzinger J, Couder Y, 2002, "A constitutive property 
of the local organization of leaf venation networks", Physical Review E 65(6) 061914 

[Buhl et al. 2006] Buhl J, Gautrais J, Reeves N, Sole R V, Valverde S, Kuntz P, Theraulaz G, 2006, "Topo- 
logical Patterns in Street Networks of Self Organized Urban Settlements" European Physical Journal 



21 



B 49 513 

[Cardillo et al. 2006] Cardillo A, Scellato S, Latora V, Porta S, 2006, "Structural properties of planar 

graphs of urban street patterns", Physical Review E 73(6) 066107 
[Chan et al. 2011] Chan SHY, Donner R V, Lammer S, 201 1, "Urban road networks - spatial networks 

with universal geometric features?", European Physical Journal B 84(4) doi:10.1140/epjb/e2011- 

10889-3 

[Clauset et al. 2009] Clauset A, Shalizi C, Newman M E J, 2009, "Power-Law Distributions in Empirical 

Data", SIAM Review 51(4) 661-703 doi:10.1137/070710111, |http : //epubs . siam. org/doi/| 

|pdf/10 . 1137/0707101111 
[Couder et al. 2002] Couder Y, Pauchard L, Allain C, Adda-Bedia M, Douady S, 2002, "The leaf venation 

as formed in a tensorial field, European Physical Journal B 28 135-138 
[Crucitti et al. 2006] Crucitti P, Latora V, Porta S, 2006, "Centrality measures in spatial networks of urban 

streets", Physical Review E 73(3) 036125 
[Euler 1736] Euler L, 1736, "Solutio problematis ad geometriam situs pertinentis", Comm. Acad. Sci. 
[Expert et al. 201 1] Expert P, Evans T S, Blondel V D, Lambiotte R, 201 1, "Uncovering space-independent 

communities in spatial networks, Proceedings of the National Academy of Sciences 108(19) 7663- 

7668 

[da F. Costa et al. 2007] da F. Costa L, Rodrigues F A, Travieso G, Villas Boas P R, 2007, "Characteriza- 
tion of complex networks: A survey of measurements", Advances In Physics 56 167 

[da F. Costa et al. 201 1] da F. Costa L, Travencolo BAN, Viana M P, Strano E, 201 1, "On the efficiency 
of transportation systems in large cities", Europhysics Letters 91(1) 18003 

[Freeman 1979] Freeman C L, 1979, "Centrality in social networks conceptual clarification", Social Net- 
works 1 215 

[Freeman 1977] Freeman C L, 1977, "A Set of Measures of Centrality Based on Betweenness", Sociometry 
40(35) 

[Garrison et al. 1962] Garrison W L, Marble D F, 1962, "The structure of transportation networks", Techi- 
cal report 

[Gastner et al. 2006] Gastner M T, Newman M E J, 2006, "The Spatial Structure of Networks", European 
Physical Journal B 49 247 

[Gastner et al. 2006] Gastner M T, Newman M E J, 2006, "Optimal design of spatial distribution net- 
works", Physical Review E 74(1) 0161 17 



22 



[Hillier et al. 1984] Hillier B, Hanson J, 1984, "The social logic of space", Cambridge University Press 
[Hillier 1999] Hillier B, 1999, "Space is the Machine: a Configurational Theory of Architecture, Cam- 
bridge University Press, Cambridge, UK. 
[Hu et al. 2008] Hu M-B, Jiang R, Wu Y-H, Wang W-X, Wu Q-S, 2008, "Urban traffic from the perspective 

of dual graph", European Physical Journal B 63 127-133 
[Jiang 2007] Jiang B, 2007, "A topological pattern of urban street networks: Universality and peculiarity", 

Physica A Statistical Mechanics and its Applications 384 647-655 
[Kalapala et al. 2006] Kalapala V, Sanwalani V, Clauset A, Moore C, 2006, "Scale invariance in road net- 
works", Physical Review E 73(2) 026130 
[Kansky et al. 1989] Kansky K, Danscoine P, 1989, "Measures of network structure", Flux 5(1) 89-121 
[Lammer et al. 2006] Lammer S, Gehlsen B, Helbing D, 2006, "Scaling laws in the spatial structure of 

urban road networks", Physica A Statistical Mechanics and its Applications 363 89-95 
[Latora et al. 2001] Latora V, Marchiori M, 2001 , "Efficient Behavior of Small-World Networks", Physical 

Review Letters 87(19) 198701 
[Masucci et al. 2009] Masucci A P, Smith D, Crooks A, Batty M, 2009, "Random planar graphs and the 

London street network", European Physical Journal B 71 259-271 
[Milo et al. 2002] Milo R, Shen-Orr S, Itzkovitz S, Kashan N, Chklovskii D, Alon U, 2002, "Network 

Motifs: Simple Building Blocks of Complex Networks", Science 298 824 
[Newman 2010] Newman M E J, 2010, "Networks - An introduction", Oxford University Press, 
[Perna et al. 2011] Perna A, Kuntz P, Douady S, 2011, "Characterization of spatial networklike patterns 

from junction geometry", Physical Review E 83(6) 066106 
[Porta et al. 2006] Porta S, Crucitti P, Latora V, 2006, "The network analysis of urban streets: a primal 

approach", Environment and Planning B: Planning and Design 33(5) 705-725 
[Porta et al. 2006] Porta S, Crucitti P, Latora V, 2006, "The network analysis of urban streets: A dual 

approach", Physica A Statistical Mechanics and its Applications 369 853-866 
[Ratti et al. 2010] Ratti C, Sobolevsky S, Calabrese F, Andris C, Reades J, Martino M, Claxton R, Strogatz 

S H, 2010, "Redrawing the Map of Great Britain from a Network of Human Interactions", PLoS 

ONE,5{\2) el 4248 

[Sabidussi 1966] Sabidussi G, 1966, "The centrality index of a graph", Psychometrika 31(4) 581-603 
[Scott 2000] Scott J, 2000, "Social Network Analysis: A Handbook - 2nd Edition", Sage, London (UK), 



23 



[Strano et al. 2012] Strano E, Nicosia V, Latora V, Porta S, Barthelemy M, 2012, "Elementary processes 
governing the evolution of road networks", Scientific Reports 2 296 

[Travencolo et al. 2008] Travencolo B A N, da F. Costa L, 2008, "Accessibility in complex networks", 
Physics Letters A 373(1) 89-95 

[Travencolo et al. 2008] Travencolo B A N, da F. Costa L, 2008, "Hierarchical spatial organization of 
geographical networks", Journal of Physics A: Mathematical and Theoretical 41(22) 224004 

[Travencolo et al. 2009] Travencolo BAN, Viana M P, da F. Costa L, 2009, "Border detection in complex 
networks", New Journal of Physics 11(6) 063019 

[Viana et al. 2011] Viana M P, da F Costa L, 2011, "Fast long-range connections in transportation net- 
works", Physics Letters A 375 1626-1629 

[Wasserman et al. 1994] Wasserman S, Faust K, 1994, "Social Network Analysis", Cambridge University 
Press 



